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Relations and Partial Orders 

A relation is a mathematical tool for describing associations between elements of 
sets. Relations are widely used in computer science, especially in databases and 
scheduling applications. A relation can be defined across many items in many sets, 
but in this text, we will focus on binary relations, which represent an association 
between two items in one or two sets. 



7.1 Binary Relations 

7.1.1 Definitions and Examples 

Definition 7.1.1. Given sets A and B, a binary relation R : A ^ B from^ A 
to B is a subset of ^ x fi. The sets A and B are called the domain and codomain 
of R, respectively. We commonly use the notation aRb or a b to denote that 
{a,b) e R. 

A relation is similar to a function. In fact, every function / : ^ ^ S is a rela- 
tion. In general, the difference between a function and a relation is that a relation 
might associate multiple elements ofB with a single element of^, whereas a func- 
tion can only associate at most one element of B (namely, / (a)) with each element 
a € A. 

We have already encountered examples of relations in earher chapters. For ex- 
ample, in Section 5.2, we talked about a relation between the set of men and the 
set of women where mRw if man m Ukes woman w. In Section 5.3, we talked 
about a relation on the set of MIT courses where ci Rc2 if the exams for ci and C2 
cannot be given at the same time. In Section 6.3, we talked about a relation on the 
set of switches in a network where si Rs2 if 5i and S2 are directly connected by a 
wire that can send a packet from sito S2. We did not use the formal definition of a 
relation in any of these cases, but they are all examples of relations. 

As another example, we can define an "in-charge-of" relation T from the set of 
MIT faculty F to the set of subjects in the 2010 MIT course catalog. This relation 
contains pairs of the form 

((instructor-name) , (subject-num)) 
^We also say that the relationship is between A and B, or on Aii B = A. 
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Figure 7,1 Some items in the "in-charge-of" relation T between faculty and sub- 
ject numbers. 

where the faculty member named (instructor-name) is in charge of the subject with 
number (subject-num). So T contains pairs like those shown in Figure 7.1. 

This is a surprisingly complicated relation: Meyer is in charge of subjects with 
three numbers. Leighton is also in charge of subjects with two of these three 
numbers — ^because the same subject. Mathematics for Computer Science, has two 
numbers (6.042 and 18.062) and Meyer and Leighton are jointly in-charge-of the 
subject. Freeman is in-charge-of even more subjects numbers (around 20), since 
as Department Education Officer, he is in charge of whole blocks of special sub- 
ject numbers. Some subjects, like 6.844 and 6.00 have only one person in-charge. 
Some faculty, Uke Guttag, are in-charge-of only one subject number, and no one 
else is jointly in-charge-of his subject, 6.00. 

Some subjects in the codomain, A'^, do not appear in the list — that is, they are 
not an element of any of the pairs in the graph of T\ these are the Fall term only 
subjects. Similarly, there are faculty in the domain, F , who do not appear in the 
list because all their in-charge-of subjects are Fall term only. 

7.1.2 Representation as a Bipartite Graph 

Every relation R . A ^ B can be easily represented as a bipartite graph G = 
(F, E) by creating a "left" node for each element of A and a "right" node for each 
element of B. We then create an edge between a left node u and a right node v 
whenever aRh. Similarly, every bipartite graph (and every partition of the nodes 
into a "left" and "right" set for which no edge connects a pair of left nodes or a pair 
of right nodes) determines a relation between the nodes on the left and the nodes 
on the right. 
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7.1. Binary Relations 

6.042 
18.062 
6.844 
6.011 
6.881 
6.882 
6.UAT 
6.00 

Figure 7.2 Part of the bipartite graph for the "in charge of relation T from Fig- 
ure 7.1. 

For example, we have shown part of the bipartite graph for the in-charge-of 
relation from Figure 7.1 in Figure 7.2. In this case, there is an edge between 
(instructor-name) and (subject-number) if (instructor-name) is in charge of (subject-number). 

A relation R . A ^ B between finite sets can also be represented as a matrix 
A = {uij } where 

1 if the /th element of A is related to the 7 th element of B 
otherwise 

for 1 < / < I y4 1 and 1 < j < \B\. For example, the matrix for the relation in 
Figure 7.2 (but restricted to the five faculty and eight subject numbers shown in 
Figure 7.2, ordering them as they appear top-to-bottom in Figure 7.2) is shown in 
Figure 7.3. 

7.1.3 Relational Images 

The idea of the image of a set under a function extends directly to relations. 
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Figure 7.3 The matrix for the "in charge of" relation T restricted to the five 
faculty and eight subject numbers shown in Figure 7.2. The (3, 4) entry of this 
matrix is 1 since the third professor (Freeman) is in charge of the fourth subject 
number (6.011). 

Definition 7.1.2. The image of a set Y under a relation R : A ^ B, written R(Y), 
is the set of elements that are related to some element in Y, namely, 

R{Y) ::= { € S I yRb for some y eY}. 

The image of the domain, R(A), is called the range of R. 

For example, to find the subject numbers that Meyer is in charge of, we can look 
for all the pairs of the form 

(Meyer, (subject-number)) 

in the graph of the teaching relation T, and then just list the right-hand sides of these 
pairs. These right-hand sides are exactly the image T (Meyer), which happens to 

be {6.042, 18.062, 6.844}. Similarly, since the domain F is the set of all in-charge 
faculty, T{F), the range of T, is exactly the set of all subjects being taught. 

7.1.4 Inverse Relations and Images 

Definition 7.1.3. The inverse R~^ of a relation R . A ^ B is the relation from B 
to A defined by the rule 

bR~^a if and only if aRb. 

The image of a set under the relation is called the inverse image of the set. 
That is, the inverse image of a set X under the relation R is R~^ (X). 

Continuing with the in-charge-of example above, we can find the faculty in 
charge of 6.UAT by taking the pairs of the form 

((instructor-name) , 6.UAT) 
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7.2. Relations and Cardinality 

for the teaching relation T, and then just listing the left-hand sides of these pairs; 
these turn out to be just Eng and Freeman. These left-hand sides are exactly the 
inverse image of {6.UAT} under T. 

7.1.5 Combining Relations 

There are at least two natural ways to combine relations to form new relations. For 

example, given relations R : B ^ C and S : A B, the composition of R with S 
is the relation (R o S) . A ^ C defined by the rule 

a(Ro S)c IFF 3b € B. (bRc) and (aSb) 

where a e A and c e C. 

As a special case, the composition of two functions f . B ^ C and g : A ^ B 
is the function / o g : A ^ C defined by 

(f o g)ia) = figia)) 

for all a e A. For example, if A = B = C = M., g(x) = x -I- 1 and f(x) = x^, 
then 

(fog)(x) = (x + lf 

= x^ + 2x + I. 

One can also define the product of two relations Ri : Ai ^ B\ and R2'. A2^ 
B2 to be the relation S = Ri x R2 where 

S : Ai X A2 ^ Bi X B2 

and 

(ai,a2)S(bi,b2) iff and a2^2^2- 



7.2 Relations and Cardinality 

7.2.1 Surjective and Injective Relations 

There are some properties of relations that will be useful when we take up the topic 
of counting in Part 111 because they imply certain relations between the sizes of 
domains and codomains. In particular, we say that a binary relation R : A ^ B is 

• surjective if every element of B is assigned to at least one element of A. 
More concisely, R is surjective iff R{A) = B (that is, if the range of R is the 
codomain of R), 
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• total when every element of A is assigned to some element of B. More 
concisely, R is total iff ^ = R~^(B), 

• infective if every element of B is mapped at most once, and 

• bijective if R is total, surjective, injective, and a function-^. 

We can illustrate these properties of a relation R : A ^ B in terms of the cor- 
responding bipartite graph G for the relation, where nodes on the left side of G 
correspond to elements of A and nodes on the right side of G correspond to ele- 
ments of B. For example: 

• is a function" means that every node on the left is incident to at most one 
edge. 

• "R is total" means that every node on the left is incident to at least one edge. 
So if is a function, being total means that every node on the left is incident 
to exactly one edge. 

• is surjective" means that every node on the right is incident to at least one 
edge. 

• "/? is injective" means that every node on the right is incident to at most one 
edge. 

• "i? is bijective" means that every node on both sides is incident to precisely 
one edge (that is, there is a perfect matching between A and B). 

For example, consider the relations Ri and R2 shown in Figure 7.4. /?i is a total 
surjective function (every node in the left column is incident to exactly one edge, 
and every node in the right column is incident to at least one edge), but not injective 
(node 3 is incident to 2 edges). R2 is a total injective function (every node in the 
left column is incident to exactly one edge, and every node in the right column 
is incident to at most one edge), but not surjective (node 4 is not incident to any 
edges). 

Notice that we need to know what the domain is to determine whether a relation 
is total, and we need to know the codomain to determine whether it's surjective. 
For example, the function defined by the formula 1 /x^ is total if its domain is M"*" 
but partial if its domain is some set of real numbers that includes 0. It is bijective 
if its domain and codomain are both M"*", but neither injective nor surjective it is 
domain and codomain are both M. 

■^These words surjective, injective, and bijective are not very memorable. Some authors use the 
possibly more memorable phrases onto for surjective, one-to-one for injective, and exact correspon- 
dence for bijective. 
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is shown in (b). 
7.2.2 Cardinality 

The relational properties in Section 7.2. 1 are useful in figuring out the relative sizes 
of domains and codomains. 

If ^ is a finite set, we use |^| to denote the number of elements in A. This is 
called the cardinality of ^. In general, a finite set may have no elements (the empty 
set), or one element, or two elements, . . . , or any nonnegative integer number of 
elements, so for any finite set, \A\ e N. 

Now suppose R : A B is a function. Then every edge in the bipartite 
graph G = (V, E) for R is incident to exactly one element of A, so the num- 
ber of edges is at most the number of elements of A. That is, if /? is a function, 
then 

1^1 < Ml- 

Similarly, if R is surjective, then every element of B is incident to an edge, so there 
must be at least as many edges in the graph as the size of B. That is 

1^1 > 

Combining these inequalities implies that R . A ^ B ha surjective function, then 
1^1 > \B\. This fact and two similar rules relating domain and codomain size to 
relational properties are captured in the following theorem. 
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Theorem 7.2.1 (Mapping Rules). Let A and B be finite sets. 

1. If there is a surjectionfrom A to B, then \A\ > 

2. If there is an injection from A to B, then \A\ < 

3. If there is a bijection between A and B, then \A\ = \B\. 

Mapping rule 2 can be explained by the same kind of reasoning we used for 
rule 1. Rule 3 is an immediate consequence of the first two mapping rules. 

We will see many examples where Theorem 7.2.1 is used to determine the car- 
dinaUty of a finite set. Later, in Chapter 13, we will consider the case when the 
sets are infinite and we'll use surjective and injective relations to prove that some 
infinite sets are "bigger" than other infinite sets. 



7.3 Relations on One Set 

For the rest of this chapter, we are going to focus on relationships between elements 
of a single set; that is, relations from a set ^ to a set B where A = B. Thus, a 
relation on a set ^ is a subset R ^ A x A. Here are some examples: 

• Let ^ be a set of people and the relation R describe who Ukes whom: that is, 
(x, y) e Rif and only if x likes y. 

• Let ^ be a set of cities. Then we can define a relation R such that xRy if and 
only if there is a nonstop flight from city x to city y. 

• Let A — Ij and let xRy hold if and only if x = y (mod 5). 

• Let ^ = N and let xRy if and only if x | j. 

• Let ^ = N and let xRy if and only if x < y. 

The last examples clarify the reason for using xRy or x ~r y to indicate that the 
relation R holds between x and y: many common relations (<, <, =, |, =) are 
expressed with the relational symbol in the middle. 

7.3.1 Representation as a Digraph 

Every relation on a single set A can be modeled as a directed graph (albeit one 
that may contain loops). For example, the graph in Figure 7.5 describes the "likes" 
relation for a particular set of 3 people. 
In this case, we see that: 
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Bob 



Figure 7.5 The directed graph for the "hkes" relation on the set {Bill, Bob, Julie}. 




3 9 11 



Figure 7.6 The digraph for divisibility on { 1 , 2 12}. 

• Juhe likes Bill and Bob, but not herself. 

• Bill hkes only himself. 

• Bob likes Julie, but not Bill nor himself. 

Everything about the relationship is conveyed by the directed graph and nothing 
more. This is no coincidence; a set A together with a relation R is precisely the 
same thing as directed graph G = (V, E) with vertex set F = ^ and edge set 
E — R (where E may have loops). 

As another example, we have illustrated the directed graph for the divisibihty 
relationship on the set {1, 2, 12} in Figure 7.6. In this graph, every node has 
a loop (since every positive number divides itself) and the composite numbers are 
the nodes with indegree more than 1 (not counting the loop). 

Relations on a single set can also be represented as a 0, 1-matrix. In this case, 
the matrix is identical to the adjacency matrix for the corresponding digraph. For 
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example, the matrix for the relation shown in Figure 7.5 is simply 



1 1 

1 

1 



:) 



where v\= Julie, vi = Bill, and vs = Bob. 

7.3.2 Symmetry, Transitivity, and Other Special Properties 

Many relations on a single set that arise in practice possess one or more noteworthy 
properties. These properties are summarized in the box on the following page. In 
each case, we provide the formal of the definition of the property, explain what the 
property looks like in a digraph G for the relation, and give an example of what the 
property means for the "likes" relation. 

For example, the congruence relation modulo 5 on Z is reflexive symmetric, and 
transitive, but not irreflexive, antisymmetric, or asymmetric. The same is true for 
the "connected" relation R . V ^ V on an undirected graph G = (V, E) where 
uRv if u and v are in the same connected component of graph G. In fact, relations 
that have these three properties are so common that we give them a special name: 
equivalence relations. We will discuss them in greater detail in just a moment. 

As another example, the "divides" relation on Z"*" is reflexive, antisymmetric, 
and transitive, but not irreflexive, symmetric, or asymmetric. The same is true for 
the "<" relation on M. Relations that have these three properties are also very 
common and they fall into a special case of relations called a partial order. We will 
discuss partial orders at length in Sections 7.5-7.9. 

As a final example, consider the "likes" relation on the set {Julie, Bill, Bob} il- 
lustrated in Figure 7.5. This relation has none of the six properties described in the 
box. 



A relation is an equivalence relation if it is reflexive, symmetric, and transitive. 
Congruence modulo « is an excellent example of an equivalence relation: 

• It is reflexive because x = x (mod n). 

• It is symmetric because x = y (mod n) implies y = x (mod n). 

• It is transitive because x = y (mod n) and y = z (mod n) imply that x = z 



lA Equivalence Relations 



(mod n). 
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Properties of a Relation R : A ^ A 




Reflexivitv R is reflexive if 




Vx € A. xRX. 




"Everyone likes themselves." 




Every node in G has a loop. 




Irreflexivity R is irreflexive if 




-"3x e A. xRx. 




"N^o nnp liVp^ tlipm^plvp^ " 




There are no loops in G. 




Symmetry R is symmetric if 




Vx,_y G A. xRy IMPLIES yRx. 




11 A iijv^^l5 y J Liitii y iijv^^s Ay . 




If there is an edge from x to j in G, then there is an edge from j to x in G 
as well. 




Antisymmetry R is antisymmetric if 




Vx, y € A (xRy AND yRx) implies x = y. 








There is at most one directed edge between any pair of distinct nodes. 




Asymmetry R is asymmetric if 




— '3x, y E A. xRy and yRx. 




"No one likes themselves and no pair of people like each other." 




There are no loops and there is at most one directed edge between any pair 
of nodes. 




Transitivity R is transitive if 




yx,y,z e A. (xRy and yRz) implies xRz. 




"If X likes y and y likes z, then x likes z too." 




For any walk vo,vi, . . . ,VkmG where k >2, vq ^ is in G (and, hence, 
Vi — ^ Vj is also in G for all i < j . 
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There is an even more well-known example of an equivalence relation: equality 
itself. Thus, an equivalence relation is a relation that shares some key properties 
with "=". 

7.4.1 Partitions 

There is another way to think about equivalence relations, but we'll need a couple 
of definitions to understand this alternative perspective. 

Definition 7.4.1. Given an equivalence relation ^ : ^ ^ ^, the equivalence class 
of an element x e ^ is the set of all elements of A related to x by i?. The equiva- 
lence class of X is denoted [x\. Thus, in symbols: 

[x\ = {y\xRy}. 

For example, suppose that A = 'L and xRy means that x = y (mod 5). Then 

[7] = {...,-3,2,7, 12,22,...}. 

Notice that 7, 12, 17, etc., all have the same equivalence class; that is, [7] = [12] = 
[17] = .... 

Definition 7.4.2. A partition of a finite set ^ is a collection of disjoint, nonempty 

subsets Ai, A2, . .., A„ whose union is all of A. The subsets are usually called the 
blocks of the partition.-' For example, one possible partition of ^ = {a, b, c, d, e} 
is 

Ai = {a, c} A2 = {b, e} Aj = {d}. 

Here's the connection between all this stuff: there is an exact correspondence 
between equivalence relations on A and partitions of A. We can state this as a 
theorem: 

Theorem 7.4.3. The equivalence classes of an equivalence relation on a set A form 
a partition of A. 

We won't prove this theorem (too dull even for us !), but let's look at an example. 

^We think they should be called the parts of the partition. Don't you think that makes a lot more 
sense? 
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The congruent-mod-5 relation partitions the integers into five equivalence classes: 

{...,-5,0,5,10,15,20,...} 
{...,-4,1,6,11,16,21,...} 
{...,-3,2,7, 12,17,22,...} 
{...,-2,3,8,13,18,23,...} 
{...,-1,4,9,14,19,24,...} 

In these terms, x = y (mod 5) is equivalent to the assertion that x and y are both 
in the same block of this partition. For example, 6=16 (mod 5), because they're 
both in the second block, but 2^9 (mod 5) because 2 is in the third block while 
9 is in the last block. 

In social terms, if "likes" were an equivalence relation, then everyone would be 
partitioned into cliques of friends who all like each other and no one else. 



7.5 Partial Orders 

7.5.1 Strong and Weak Partial Orders 

Definition 7.5.1. A relation ^ on a set ^ is a weak partial order if it is transitive, 
antisymmetric, and reflexive. The relation is said to be a strong partial order if it 
is transitive, antisymmetric, and irreflexive.^ 

Some authors defined partial orders to be what we call weak partial orders, but 
we'll use the phrase partial order to mean either a weak or a strong partial order. 
The difference between a weak partial order and a strong one has to do with the 
reflexivity property: in a weak partial order, every element is related to itself, but in 
a strong partial order, no element is related to itself. Otherwise, they are the same 
in that they are both transitive and antisymmetric. 

Examples of weak partial orders include "<" on M, "c" on the set of subsets 
of (say) Z, and the "divides" relation on N"*". Examples of strict partial orders 
include "<" on M, and "C" on the set of subsets of Z.^ 

^Equivalently, the relation is transitive and asymmetric, but stating it this way might have obscured 
the irreflexivity property. 

^If you are not feeling comfortable with all the definitions that we've been throwing at you, it's 
probably a good idea to verify that each of these relations are indeed partial orders by checking that 
they have the transitivity and antisymmetry properties. 
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We often denote a weak partial order with a symbol such as :< or c instead of a 
letter such as R. This makes sense from one perspective since the symbols call to 
mind < and c, which define common partial orders. On the other hand, a partial 
order is really a set of related pairs of items, and so a letter like R would be more 
normal. 

Likewise, we will often use a symbol like k or iz to denote a strong partial order. 
7.5.2 Total Orders 

A partial order is "partial" because there can be two elements with no relation 
between them. For example, in the "divides" partial order on {1, 2, ... , 12}, there 
is no relation between 3 and 5 (since neither divides the other). 

In general, we say that two elements a and b are incomparable if neither a <b 
nor b <a. Otherwise, iia <b orb <a, then we say that a and b are comparable. 

Definition 7.5.2. A total order is a partial order in which every pair of distinct 
elements is comparable. 

For example, the "<" partial order on M is a total order because for any pair of 
real numbers x and y, either x < y ox y < x. The "divides" partial order on 
{1,2,..., 12} is not a total order because 3 I 5 and 5 | 3. 



7.6 Posets and DAGs 

7.6.1 Partially Ordered Sets 

Definition 7.6.1. Given a partial order :< on a set A, the pair :<) is called a 

partially ordered set or poset. 

In terms of graph theory, a poset is simply the directed graph G = (A,<) with 
vertex set A and edge set <. For example. Figure 7.6 shows the graph form of the 
poset for the "divides" relation on {1, 2, . . . , 12}. We have shown the graph form 
of the poset for the "<"-relation on {1, 2, 3, 4} in Figure 7.7. 

7.6.2 Posets Are Acyclic 

Did you notice anything that is common to Figures 7.6 and 7.7? Of course, they 
both exhibit the transitivity and antisymmetry properties. And, except for the loops 
in Figure 7.6, they both do not contain any cycles. This is not a coincidence. In fact, 
the combination of the transitivity and asymmetry properties imply that the digraph 
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Figure 7.7 Representing the poset for the "<"-relation on { 1 , 2, 3, 4} as a digraph. 

for any poset is an acyclic graph (that is, a DAG), at least if you don't count loops 
as cycles. We prove this fact in the following theorem. 

Theorem 7.6.2. A poset has no directed cycles other than self-loops. 

Proof. We use proof by contradiction. Let {A, <) be a poset. Suppose that there 
exist « > 2 distinct elements a\,a2, ...,an such that 

a\<a2<a-i<---< a„-i <an< a\. 

Since a\ < and a 2 ^ ^^3, transitivity implies a\ < a^. Another application 
of transitivity shows that ai < 04 and a routine induction argument establishes 
that a I < an- Since we know that fl„ < ai, antisymmetry implies ai — a„, 
contradicting the supposition that ai, ... , an axe distinct and n >2. Thus, there is 
no such directed cycle. ■ 

Thus, deleting the self-loops from a poset leaves a directed graph without cycles, 
which makes it a directed acyclic graph or DAG. 

7.6.3 Transitive Closure 

Theorem 7.6.2 tells us that every poset corresponds to a DAG. Is the reverse true? 
That is, does every DAG correspond to a poset? The answer is "Yes," but we 
need to modify the DAG to make sure that it satisfies the transitivity property. For 
example, consider the DAG shown in Figure 7.8. As any DAG must, this graph 
satisfies the antisymmetry property^ but it does not satisfy the transitivity property 
because vi ^ V2 and V2 vs are in the graph but ui ^ j;3 is not in the graph. 

Hfu->v and u ^ m are in a digraph G, then G would have a cycle of length 2 and it could not 
be a DAG. 
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Figure 7.8 A 6-node digraph that does not satisfy the transitivity property. 




Figure 7.9 The transitive closure for the digraph in Figure 7.8. The edges that 
were added to form the transitive closure are shown in bold. 

Definition 7.6.3. Given a digraph G = (V, E), the transitive closure of G is the 
digraph G"*" = (V,E'^) where 

E'^ = {u ^ V \ there is a directed path of positive length from m to u in G }. 

Similarly, if R is the relation corresponding to G, the transitive closure of R (de- 
noted R'^) is the relation corresponding to G"*". 

For example, the transitive closure for the graph in Figure 7.8 is shown in Fig- 
ure 7.9. 

If G is a DAG, then the transitive closure of G is a strong partial order. The proof 
of this fact is left as an exercise in the problem section. 

7.6.4 The Hasse Diagram 

One problem with viewing a poset as a digraph is that there tend to be lots of edges 
due to the transitivity property. Fortunately, we do not necessarily have to draw 
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all the edges if we know that the digraph corresponds to a poset. For example, 
we could choose not to draw any edge which would be implied by the transitivity 
property, knowing that it is really there by implication. In general, a Basse diagram 
for a poset (A, <) is a digraph with vertex set A and edge set :< minus all self-loops 
and edges implied by transitivity. For example, the Hasse diagrams of the posets 
shown in Figures 7.6 and 7.7 are shown in Figure 7.10. 



7.7 Topological Sort 

A total order that is consistent with a partial order is called a topological sort. More 
precisely. 

Definition 7.7.1. A topological sort of a poset (A, <) is a total order (A,<t) such 
that 

X < y IMPLIES X <T y. 

For example, consider the poset that describes how a guy might get dressed for 
a formal occasion. The Hasse diagram for such a poset is shown in Figure 7.11. 
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left sock right sock underwear shirt 




jacket 



Figure 7.11 The Hasse diagram for a poset that describes which items much pre- 
cede others when getting dressed. 

In this poset, the set is all the garments and the partial order specifies which items 
much precede others when getting dressed. 

There are several total orders that are consistent with the partial order shown in 
Figure 7.11. We have shown two of them in list form in Figure 7.12. Each such 
fist is a topological sort for the partial order in Figure 7.11. In what follows, we 
will prove that every finite poset has a topological sort. You can think of this as a 
mathematical proof that you can get dressed in the morning (and then show up for 
math lecture). 

Theorem 7.7.2. Every finite poset has a topological sort. 

We'll prove the theorem constructively. The basic idea is to pull the "smallest" 
element a out of the poset, find a topological sort of the remainder recursively, and 
then add a back into the topological sort as an element smaller than all the others. 

The first hurdle is that "smallest" is not such a simple concept in a set that is only 
partially ordered. In a poset {A, <), an element x ^ A\% minimal if there is no other 
element y e A such that y < x. For example, there are four minimal elements in 
the getting-dressed poset: left sock, right sock, underwear, and shirt. (It may seem 
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underwear 


left sock 


pants 


shirt 


belt 


tie 


shirt 


underwear 


tie 


right sock 


jacket 


pants 


left sock 


right shoe 


right sock 


belt 


left shoe 


jacket 


right shoe 


left shoe 


(a) 


(b) 



Figure 7.12 Two possible topological sorts of the poset shown in Figure 7. 1 1 . In 
each case, the elements are Usted so that x < y iff x is above y in the list. 

odd that the minimal elements are at the top of the Hasse diagram rather than the 
bottom. Some people adopt the opposite convention. If you're not sure whether 
minimal elements are on the top or bottom in a particular context, ask.) Similarly, 
an element x e Ais maximal if there is no other element y e A such that x < y. 

Proving that every poset has a minimal element is extremely difficult, because it 
is not true. For example, the poset (Z, <) has no minimal element. However, there 
is at least one minimal element in every finite poset. 

Lemma 7.7.3. Every finite poset has a minimal element. 

Proof. Let {A, -<) be an arbitrary poset. Let a\, a2, . . ., Unhe a maximum-length 
sequence of distinct elements in A such that 

The existence of such a maximum-length sequence follows from the Well Ordering 
Principle and the fact that A is finite. Now ao < ai cannot hold for any element 
a^i ^ A not in the chain, since the chain already has maximum length. And at ■< ai 
cannot hold for any / > 2, since that would imply a cycle 

cii ^ ai < a2 :< • • • :< at 

and no cycles exist in a poset by Theorem 7.6.2. Therefore a i is a minimal element. 

■ 

Now we're ready to prove Theorem 7.7.2, which says that every finite poset has a 
topological sort. The proof is rather intricate; understanding the argument requires 
a clear grasp of all the mathematical machinery related to posets and relations! 
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Proof of Theorem 7.7.2. We use induction. Let P{n) be the proposition that every 
« -element poset has a topological sort. 

Base case: Every 1 -element poset is already a total order and thus is its own topo- 
logical sort. So P(l) is true. 

Inductive step: Now we assume P{n) in order to prove P(n -\- 1) where n > 1. 
Let -<) be an (« -|- l)-element poset. By Lemma 7.7.3, there exists a minimal 
element \xva & A. Remove a and all pairs in -< involving a to obtain an ^-element 
poset (yM , :<'). This has a topological sort (^', -^j) by the assumption P{n). Now 
we construct a total order :<r) by adding a back as an element smaller than all 
the others. Formally, let 

<t=^'t \ z e A). 

All that remains is to check that this total order is consistent with the original partial 
order {A,<)\ that is, we must show that 

X <y IMPLIES X <T y- 

We assume that the left side is true and show that the right side follows. There are 

two cases. 

Case 1 If X = a, then a y holds, because a ^j- z for all z g A. 

Case 2 ii x ^ a, then y can not equal a either, since a is a minimal element in the 
partial order <. Thus, both x and y are in A' and so x <' y. This means 
X ^ J y, since -<'j is a topological sort of the partial order <' . And this 
implies x <t y since -<t contains -<'j. 

Thus, {A, <t) is a topological sort of {A, <). This shows that P{n) implies 
P(« -|- 1) for all « > 1. Therefore is true for all « > 1 by the principle of 
induction, which proves the theorem. ■ 



7.8 Parallel Task Scheduling 

When items of a poset are tasks that need to be done and the partial order is a 
precedence constraint, topological sorting provides us with a way to execute the 
tasks sequentially without violating the precedence constraints. 

But what if we have the ability to execute more than one task at the same time? 
For example, suppose that the tasks are programs, the partial order indicates data 
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Figure 7.13 A parallel schedule for the tasks-in-getting-dressed poset in Fig- 
ure 7.11. The tasks in Ai can be performed in step / for 1 < / < 4. A chain 
of length 4 (the critical path in this example) is shown with bold edges. 



dependence, and we have a parallel machine with lots of processors instead of a 
sequential machine with only one processor. How should we schedule the tasks so 
as to minimize the total time used? 

For simpUcity, assume all tasks take 1 unit of time and we have an unUmited 
number of identical processors. For example, in the clothes example in Figure 7.11, 
how long would it take to handle all the garments? 

In the first unit of time, we should do all minimal items, so we would put on our 
left sock, our right sock, our underwear, and our shirt.^ In the second unit of time, 
we should put on our pants and our tie. Note that we cannot put on our left or right 
shoe yet, since we have not yet put on our pants. In the third unit of time, we should 
put on our left shoe, our right shoe, and our belt. Finally, in the last unit of time, 
we can put on our jacket. This schedule is illustrated in Figure 7.13. 

The total time to do these tasks is 4 units. We cannot do better than 4 units of 

^ Yes, we know that you can't actually put on both socks at once, but imagine you are being dressed 
by a bunch of robot processors and you are in a big hurry. Still not working for you? Ok, forget about 
the clothes and imagine they are programs with the precedence constraints shown in Figure 7. 11. 
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time because there is a sequence of 4 tasks, each needing to be done before the 

next, of length 4. For example, we must put on our shirt before our pants, our pants 
before our belt, and our belt before our jacket. Such a sequence of items is known 
as a chain 

Definition 7.8.1. A chain is a sequence a\ ^ ai < ■ ■ ■ < at, where a, ^ aj for all 
i ^ j, such that each item is comparable to the next in the chain, and it is smaller 
with respect to :<. The length of the chain is t, the number of elements in the chain. 

Thus, the time it takes to schedule tasks, even with an unhmited number of pro- 
cessors, is at least the length of the longest chain. Indeed, if we used less time, then 
two items from a longest chain would have to be done at the same time, which con- 
tradicts the precedence constraints. For this reason, a longest chain is also known 
as a critical path. For example. Figure 7.13 shows the critical path for the getting- 
dressed poset. 

In this example, we were in fact able to schedule all the tasks in t steps, where 
t is the length of the longest chain. The really nice thing about posets is that this 
is always possible! In other words, for any poset, there is a legal parallel schedule 
that runs in t steps, where t is the length of the longest chain. 

There are lots of ways to prove this fact. Our proof will also give us the corre- 
sponding schedule in t time steps, and allow us to obtain some nice corollaries. 

Theorem 7.8.2. Given any finite poset {A, <) for which the longest chain has 
length t, it is possible to partition A into t subsets A\, A2, At such that for 
all i e {\,2, . . . ,t} and for all a € Aj, we have that all b < a appear in the set 
Ai U...UAi-i. 

Before proving this theorem, first note that for each /, all items in Ai can be 
scheduled in time step / . This is because all preceding tasks are scheduled in pre- 
ceding time steps, and thus are already completed. So the theorem implies that 

Corollary 7.8.3. The total amount of parallel time needed to complete the tasks is 
the same as the length of the longest chain. 

Proof of Theorem 7.8.2. For all a e Ai, put a in Aj, where / is the length of the 
longest chain ending at a. For example, the Ai for the getting-dressed poset are 
shown in Figure 7.13. In what follows, we show that for all for all a € Ai and 
for allb < a with b ^ a,we have b e AiU A2D . . .U Ai-i. 

We prove this by contradiction. Assume there is some a e Ai, and b < a 
with b ^ a and b ^ ^1 U ^2 U . . . U By the way we defined At, this implies 
there is a chain of length at least / ending at b. Since b < a and b a, we can 
extend this chain to a chain of length at least / + 1, ending at a. But then a could 
not be in Ai . This is a contradiction. ■ 
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7.9. Dilworth's Lemma 

If we have an unlimited number of processors, then the time to complete all tasks 
is equal to the length of the longest chain of dependent tasks. The case where there 
are only a limited number of processors is very useful in practice and it is covered 
in the Problems section. 



7.9 Dilworth's Lemma 

Definition 7.9.1. An antichain in a poset is a set of elements such that any two 
elements in the set are incomparable. 

For example, each Ai in the proof of Theorem 7.8.2 and in Figure 7.13 is an 
antichain since its elements have no dependencies between them (which is why 
they could be executed at the same time). 

Our conclusions about scheduUng also tell us something about antichains. 

Corollary 7.9.2. If the largest chain in a partial order on a set A is of size t, then 
A can be partitioned into t antichains. 

Proof. Let the antichains be the sets ^1, ^2, ••• defined in Theorem 7.8.2. ■ 

Corollary 7.9.2 implies a famous result^ about partially ordered sets: 

Lemma 7.9.3 (Dilworth). For all t > 0, every partially ordered set with n elements 
must have either a chain of size greater than t or an antichain of size at least n/t. 

Proof. By contradiction. Assume that the longest chain has length at most t and the 
longest antichain has size less than n/t. Then by Corollary 7.9.2, the n elements 
can be partitioned into at most t antichains. Hence, there are fewer than t-n/t = n 
elements in the poset, which is a contradiction. Hence there must be a chain longer 
than t or an antichain with at least n/t elements. ■ 

Corollary 7.9.4. Every partially ordered set with n elements has a chain of size 
greater than ^/n or an antichain of size at least ~Jn. 

Proof. Set ? = in Lemma 7.9.3. ■ 

As an appUcation, consider a permutation of the numbers from 1 to « arranged 
as a sequence from left to right on a line. Corollary 7.9.4 can be used to show 
that there must be a V"-length subsequence of these numbers that is completely 

^Lemma 7.9.3 also follows from a more general result known as Dilworth's Theorem that we will 
not discuss. 
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increasing or completely decreasing as you move from left to right. For example, 
the sequence 

7,8,9,4,5,6, 1,2,3 

has an increasing subsequence of length 3 (for example, 7, 8, 9) and a decreasing 
subsequence of length 3 (for example, 9, 6, 3). The proof of this result is left as an 
exercise that will test your ability to find the right partial order on the numbers in 
the sequence. 
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